首页> 外文OA文献 >Estimating network degree distributions under sampling: An inverse problem, with applications to monitoring social media networks
【2h】

Estimating network degree distributions under sampling: An inverse problem, with applications to monitoring social media networks

机译:估计抽样下的网络度分布:逆   问题,应用于监控社交媒体网络

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Networks are a popular tool for representing elements in a system and theirinterconnectedness. Many observed networks can be viewed as only samples ofsome true underlying network. Such is frequently the case, for example, in themonitoring and study of massive, online social networks. We study the problemof how to estimate the degree distribution - an object of fundamental interest- of a true underlying network from its sampled network. In particular, we showthat this problem can be formulated as an inverse problem. Playing a key rolein this formulation is a matrix relating the expectation of our sampled degreedistribution to the true underlying degree distribution. Under many networksampling designs, this matrix can be defined entirely in terms of the designand is found to be ill-conditioned. As a result, our inverse problem frequentlyis ill-posed. Accordingly, we offer a constrained, penalized weightedleast-squares approach to solving this problem. A Monte Carlo variant ofStein's unbiased risk estimation (SURE) is used to select the penalizationparameter. We explore the behavior of our resulting estimator of network degreedistribution in simulation, using a variety of combinations of network modelsand sampling regimes. In addition, we demonstrate the ability of our method toaccurately reconstruct the degree distributions of various sub-communitieswithin online social networks corresponding to Friendster, Orkut andLiveJournal. Overall, our results show that the true degree distributions fromboth homogeneous and inhomogeneous networks can be recovered with substantiallygreater accuracy than reflected in the empirical degree distribution resultingfrom the original sampling.
机译:网络是一种流行的工具,用于表示系统中的元素及其相互连接。可以将许多观察到的网络视为某些真实基础网络的样本。例如,在监视和研究大型在线社交网络时,经常会出现这种情况。我们研究了如何从其采样网络中估算一个真实基础网络的度数分布(一个基本兴趣的对象)的问题。特别是,我们证明了该问题可以表述为反问题。在此公式中起关键作用的是一个矩阵,该矩阵将我们的采样度分布的期望与真实的基础度分布相关联。在许多网络采样设计中,可以根据设计完全定义此矩阵,并且发现该矩阵条件不佳。结果,我们的逆问题常常是不恰当的。因此,我们提供了一种约束的,惩罚性的加权最小二乘法来解决此问题。 Stein的无偏风险估计(SURE)的Monte Carlo变体用于选择惩罚参数。我们使用网络模型和采样方式的各种组合,探索了仿真结果中网络度分布估计器的行为。另外,我们证明了我们的方法能够在与Friendster,Orkut和LiveJournal相对应的在线社交网络中准确地重建各个子社区的程度分布。总体而言,我们的结果表明,与原始采样得出的经验度分布相比,从均质和非均质网络中获得的真实度分布都可以以更高的精度恢复。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号